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Methods and kits for propagatiiig and evolving nucleic acids and proteins 
FIELD OF THE INVENTION 

The invention relates to the field of directed molecular evolution and to the field of gene 
5 silencing. More specifically, the invention describes the use of the erroneous nature of 
RNA. virus replication for engineering nucleic acids and proteins with advantageous 
properties. This invention relates also to the use of RNA viruses and otiier RNA repUcons 
for providing target nucleic acid sequences in the form of double-stranded RNA, suitable 
for inducing sequence-specific gene silencing effects in eukaryotic organisms. 

10 

BACKGROUND OF THE INVENTION 

Proteins and nucleic acids are essential for the fimctioning of all biological systems. On the 
other hand, many proteins, are of considerable importance for industry, medicine, 
agriculture, bioremediation, and other applications. Potential utility of nucleic acid-based 

15 enzymes, such as ribozymes, and binding molecules have also been discussed (Burgstaller 
et al, 2002; Cobaleda and Sanchez-Garcia, 2001; de Feyter and Li, 2000; Pohorille and 
Deamer, 2002; Robertson and Ellmgton, 2001; White et a/., 2001). Practical applications 
often require properties that are irrelevant or even harmfiil for living organisms. As a 
consequence, the use of natural enzymes in industry can be limited by inefficient catalysis 

20 of non-natural substrates, low stabiUty, low tolerance for changes in operating parameters, 
poor activity in non-aqueous solutions, or requirements of expensive cofactors (Farinas et 
aL, 2001; Petrounia and Arnold, 2000). Similarly, antibodies obtained firoin immunized 
animals may not be adequate for diagnostic and therapeutic purposes due to low afBnity, 
cross-reactivity, immuno-incompatibility, and other problems (Carter, 2001; Hudson and 

25 Souriau, 2003; Winter and Harris, 1993). 

Two major strategies have been employed to improve protein performance: rational design 
and directed evolution (Arnold, 2001; Bomscheuer and Pohl, 2001). The first strategy can 
only be applied to proteins with known three-dimensional structures and remains 
30 challenging for practical use (Altamirano et aL, 2000; Nixon et aL, 1999; Quemeneur 
a/., 1998). On the contrary, directed evolution has become a popular apjproach to protein 
engineering and, fiirthermore, has been employed for selecting nucleic acid molecules with 
various biological activities (Farinas et al.^ 2001; Petrounia and Arnold, 2000). 
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All . directed evolution protocols rely on a simple Darwinian optimization algorithm 
comprising the steps of diversification and. selection. First, diversity is created vdfhin the 
population of target molecides. This is followed by selection that reveals improved variants 
that can be used as such or subjected to further rounds of evolution. Two distinct selection 

5 procedures have been used, bona fide selection and screening (see e.g. (Soumillion and 
Fastrez, 2001). Bona fide selection is based on survival or better propagation of the fittest 
variants of target molecules under selective conditions, which is conceptually similar to 
natural selection as understood in the theory of evolution. For the sake of simplicity, bona 
fide selection will be referred hereafter as "selection". Term "screening" refers to manual 

10 or automated picking of preferred variants fiom a population of target molecules. This 
procedure can be likened to the artificial selection in Darwin's theory. 

In the case of iterative diversification-selection rounds, the population of target molecules 
has to be occasionally replenished. Short peptides and oligonucleotides with predetermined 

IS sequence can be multiplied using chemical synthesis. In a more general case, RNA or 
DNA molecules are rq)roduced in vivo or in vitro through the template-copy mechanism 
according to the base complementarity rules. Proteins are commonly produced by 
translation of RNA templates (mRNAs) in either living cells (e.g. in phage, LacI, and cell- 
surface displays) (Chen and Geqrgiou, 2002; OTSfeil and Hoess, 1995; Rader and Barbas, 

20 1997; Schatz et ai, 1996; Wittrup, 2001), or cell-firee extracts (e.g. in mRNA display, 
different versions of ribosome display, and sorting in man-made compartments) (Amstutz 
et aLy 2001; Tawfik and Griffiths, 1998). To ensure an adequate selection, proteins having 
desired properties (phenotype) have to be linked with the cognate nucleic acids (genotype). 

25 A specialist in the field of directed evolution would recognize two major challenges in the 
relevant art. First, sufficientiy large libraries of target molecules have to be constmcted and 
searched for advantageous variants. Second, numerous directed evolution techniques allow 
for selecting improved binding activities, whereas only limited number of protocols can be 
used to alter eazymatic properties of target molecules. 

30 

With regard to the first challenge, diversification of target molecules is usually achieved 
using mutagenesis and/or recombination. Error-prone PGR and synthetic oligonucleotide- 
based techniques, such as e.g. cassette mutagenesis, have been methods of choice for 
diversi^dng nucleic acid populations in vitro (Trower, 1996). Similarly, in vitro 
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recombination procedures have been described, including gene shufEling, exon shuffling, 
and nonhomologous random recombination CBitfker et al.^ 2002a; Coco et aL, 2001; 
Kolkman and Stemmer, 2001; Kurtzman et al^ 2001; Stemmw, 1994a; Stemmer, 1994b): 
Because of the heavy use of PGR, DNA fragmentation, gel purification, DNA ligation, and 
5 other in vitro techniques, most of the above methods require the expertise of highly skilled 
technicians and can be time-consuming or resource-intensive. If the selection/screening 
strategy is straightforward, the steps of mutagenesis/recombination in vitro may account 
for nearly all the time and effort spent on a directed evolution project. 

10 Notably, several in vivo mutagenesis approaches have be^ described, examples including 
the use of mutator strains and enhancing mutation rates in wild-type cells by chemicals or 
radiation (Long-McGie et aL, 2000; Selifonova et aL, 2001; Trower; 1996). These 
techniques rely on culturing cells, normally bacteria, and therefore do not involve 
substantial expenses or extensive personnel training. 

15 

However, a broader utility of mutator strains and condition-induced mutagenesis is 
hampered by the indiscriminate nature of mutations, which affect both target sequences 
and the host cell genome with the probability directiy proportional to the nucleic acid 
length. Because cellular genomes comprise a number of indispensable genes and are 

20 several orders of magnitude larger than usual directed evolution targets, the maximal 
allowed mutation rate is limited by the host tolerance. As a consequence, only moderate 
mutation rates are available to an artisan willing to modify a protein or a nucleic acid, 
which necessitates the use of large pools of cells and/or extended mutagenesis times. 
Furthermore, if the search for improved variants is bistsed on the cell survival, growth rate 

25 or morphology, advantageous mutations . in the target sequence may be masked by 
disadvantageous changes in the genetic background of the host, thus reducing the 
efficiency and accuracy of tibie selection/screening procedure. 

Concerning the second challenge for the art of directed evolution, many methods, such as 
30 phage displays, ribosomal displays, cell-surface displays, mRNA display, SELBX, and 
others, utilize conceptually simple binding procedure to select for proteins or nucleic acids 
with improved affinities to given ligand: In contrast, only few techniques have been 
reported for changing enzyme properties. There are reports where phage display and 
SELEX technologies have been adapted for evolving some enzymatic activities; however. 
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flie range of catalytic reactions which can be selected for is limited (see e.g. (Forrer et al.^ 
1999; Wilson and Szostak, 1999). Similarly, the in vitro compartmentalization metibiod, 
developed by Griffith et al. for evolving nucleic acid modification enzymes (Tawfik and 
Griffiths, 1998), reqxiires elaborate in vitro manipulations when applied to other types of 
5 enzymes (Griffiths and Tawfik, 2003). 

Expressing target genes in bacteria and screening/selecting for desired enzymatic activities 
is one of the most versatile approaches for evolving enzymes with improved properties 
(e.g. (Cohen et aL, 2001). The use of mutagenesis in vivo is extremely advantageous for 
10 this group of methods, because (in addition to the aforementioned problems of in vitro 
divCTsification techniques) the efficient delivery of large nucleic acids libraries into living 
celJ^ constitutes a m^or methodological challenge. 

Since existing methods of mutagenesis in vivo also sufiEer of serious limitations, there is a 
15 great need for a rapid, non-laborious, inexpensive method for generating diverse 
populations of target molecules in vfvo, which could be used for changing emsyme 
properties in a required fashion. Toward this end, the present invention discloses the use of 
the erroneous nature of RNA-dependent nucleic acid synthesis for the purpose of directed 
evolution. 

20 

Although related primarily to the field of directed molecular evolution, the present 
invention fiirthermore provides a novel strategy for generating double-stranded (ds) RNA 
triggers suitable for inducing sequence specific gene silencing effects in eukaryotes. A 
comprehensive description of the sequence specific gene silencing, also referred to as RNA 

25 silencing, can be found elsewhere (Baulcombe, 2002; Cogoni, 2001; Hannon, 2002; Vance 
and Vaucheret, 2001). Briefly, RNA silencing is a group of pheaomena in which dsRNA 
triggers induce sequence-specific dowiuregulatioii of the expression of target genes in 
eukaryotic organisms. The form of RNA silencing where dsRNA trigger is introduced into 
the cell artificially is called RNA interferrace (RNAi). Several important applications of 

30 RNAi have been reported ranging firom fiinctional genomics to curing disease (Barstead, 
2001; Jacque et al, 2002; Kamath et a/., 2003; Lum et al, 2003; McCaffrey et al, 2003; 
Novina a/., 2002; Pekarik a/., 2003). 
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In some applications, dsRNA. triggers, provided in an isolated form, are administered into 
living cell or cell-free extracts to induce gene silencing effects. Accordingly, several in 
vitro methods for producing dsRNA of desired sequence have been reported in the prior 
art. A large group of such methods comprise the steps of providing two self- 
5 complementary single-stranded (ss) RNA and annealing these ssRNAs into a duplex. 
Alternatively, isolated RNA-dependent KNA polymerases were used to generate dsRNAs 
from ssRNA templates in vitro (PCT/FIOO/01 135; WO 01/46396). 

As discussed above, in vitro methods may suffa: of several limitations, such as being 
10 expensive and resource-intensive and requiring skills of highly-tramed personnel. 
Furthermore, the existing in vitro methods for generating dsRNAs can provide only limited 
amounts of the product It is therefore advantageous to develop.a method wherein dsRNA 
triggers could be produced from a renewable source, such as living cell, and purified using 
a straightforward procedure. Toward this aid, the present invention offers a simple and 
15 convenient strategy whCTein RNA replicons (such as RNA virases, RNA virus-like 
particles, RNA plasmids, or derivatives thereof) are used to propagate target nucleic acid 
sequences in the form of dsRNA. 

SUMMARY 

20 

In an aspect, this invention utilizes tiie high mutation rate and adaptability of an RNA- 
based biological entity (e.g. virus) as a driving force for directed evolutioii of target 
sequences, in particular heterologous target sequences. Indeed, replication of RNA 
genomes is catalyzed by polymerases lacking proofreading frmction, which makes RNA 
25 copying an intrinsically erroneous process (Domingo et al.^ 2001). Importantly, the novel 
method for directed evolution has a substantially higher theoretical linoit for the maximal 
allowed mutation rate, than in the existing methods for mutagenesis in living cells, because 
RNA genomes are much smaller than cellular DNA genomes. This enables an accelerated 
discovery of improved variants using moderate numbers of the host cells, 

30 

Many RNA-based systems can be suitable for practicing the new method of durected 
evolution. For the purpose of this invention, it may be advantageous to use an RNA virus. 
Both true riboviruses, whose Ufe cycle proceeds entirely on the RNA level, and so-called 
reverse-transcribing virases, which alternate between RNA and DNA genomic forms 
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throughout their life cycles, are acceptable formats. However, in other CTibodiments, one 
can make use of essentially any KNA-based organism or system, including RNA virus-like 
particles, KNTA plasmids, viroids, or other RNA-based autonomous genetic elements. A 
currently preferred embodiment rely on a genetically altered bacteriophage <|>6, a dsRNA 
5 virus from the Cystoviridae family thiat infects the bacterium Pseudomonas syringae 
(Mindich, 1988; Mindich, 1999a). 

The new methods described here are intended primarily for directed evolution of proteins 
and nucleic acids. Specific applications of the method include but are not limited to 
10 improving enzymes, as well as molecules having specific binding and regulatory activities. 
In other embodiments, the method is used for optimizing KNA stability or codon usage. 

In an additional aspect, the present invention describes a novel method for inducing 
sequence-specific gene silencing in eukaryotic organisms based on RNA viruses or other 
15 RNA replicons. la the method, RNA replicons are used as vehicles for propagating target 
nucleic sequences in a dsRNA form; tiie dsRNA is purified and used to trigger silencing 
efTects. 

As with the aforementioned methods of directed evolution, a number of biological entities 
20 having RNA genomes will be appropriate systems for the use within this methodology. For 
example, at least some ssRNA viruses are known to replicate their genomes via dsRNA 
intermediates (Buck, 1996). However, for the ease of obtaining dsRNA of sufficient purity 
and in sufficient amounts it is advantageous to use vimses or other types of replicons with 
dsRNA genomes. The currently preferred embodiment relies on dsRNA viruses fix)m the 
25 Cystoviridae family, a specific example being bacteriophage (|>6. 

In yet fiirther aspect, the invention provides a novel method for constructixig recombinant 
dsRNA bacteriophages. The method takes advantage of suicide vectors wherein nucleic 
acid fragments of interest are operably linked with the sequences sufficient for detectable 
30 replication by ilie viral replication apparatus. The new method is faster and easier than 
previously described methods for constracting recombinant dsRNA bacteriophages, which 
involve in vitro packaging of procapsids' particles (Poranen et a/., 2001) or propagating 
genetically modified bacteriophages in host cells stably transformed with the plasmid 
expressing target genes (Mindich, 1999b) and references therein). 
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la the currently preferred embodimeat said suicide vector is a DNA plasmid that is 
delivered into a cell containing functional viral replication apparatus. The plastnid can not 
be stably propagated within said cell (definition of a suicide vector), but can be transiently 
5 transcribed by a DNA-dependent KNA polymerase to yield RNAs replicable by the viral 
polymerase. 

Said replicable RNAs derived from the suicide plasmid contain target nucleic acid 
sequence, which makes the suicide vector strategy useful for specific embodiments related 
10 to directed evolution. Furthermore, because RNAs replicable by dsRNA virus polymerase 
in vivo are converted into dsRNA (genomic) form, the use of the suicide vector strategy is 
highly advantageous for the embodiments of this invention that are related to inducing 
sequence specific gene silencing effects. 

15 Fxulher features, aspects and advantages of the present invention will be better understood 
firom the description of specific embodiments and examples. It should be understood, 
however, that the description and the examples are given by the way of illustration only, 
not by the way of limitation* Various changes and modifications within the spirit and the 
scope of the invention will become apparent to those skilled in the art firom the following 

20 text. Furthermore, citation of a reference throughout the entire patent text shall not be 
' ^ I interpreted as an admission that such is prior art to the presCTit invention. 

• o 

• o 

If BRIEF DESCIUFnON OF THE FIGURES 

6 

«: c 

25 The foregoing text, as well as the following description and appended claims, will be better 
understood when read in conjunction witihi the impended figures, in which: 

Figure 1 shows schematically how recombinant RNA replicons are generated using 

e« 

\^ suicide plasmid strategy. The example dqjicts constructing carrier-state Pseudomonas 

30 syringae cells that contain recombinant if6 virus expressing beta-lactamase gene (J^S-bld). 

m m 
m 

• « 

•f Figure 2 depicts: 

(A) Agarose gel electrophoresis of total RNA firom the following strains: K, Km-resistant 
HB10Y(4»6-wpO; AO, Amp-resistant HB10Y((|)6-Wfl); HB, non-infected HBIOY. Lane (|)6, 



dsRNA segments M and S extracted from fhe wild-type (|»6 (positions indicated on the 
left along with the positions of P. syringae 23S and 16S rKNTAs). Mk, dkONA markers. 
Maiker lengths in kbp are shoAvn on the right White arrowhead shows the new segment, 
M-Wa, which £q)pears in Amp-resistant cells. 
5 (B) RT-PCR analysis with npt- and Wa-specific primers was performed using RNA from: 
K:, HB10Y((|)6-wpO and AO, HBl()Y((j)6-6/fl). The reverse transcription (RT) step was 
omitted in reactions 2 and 5. Different PGR primers were used as specified under the 
panel. Positions of the npt and fr/a-specific PGR fragments are marked on the right. dsDNA 
maiker ^Ik) lengths are shown on the left. 

10 

Figure 3 shows that iif6-bla carrier cells n^idly adapt to cefotaxime. 

(A) 0.2 to 1x10^ HB10Y(<|)6-Wa) carrier state cells were plated onto LB agar containing 
either 150 p-g/ml ampicillin (AmplSO) or 50 jiig/ml cefotaxime (CtxSO). Ctx resistant 
colonies appeared after 3 days of incubation at 28°C. No colonies were detected at this 

15 time on the sector inoculated with 1x10^ HB10Y(pLM2S4) cells, which contain a plasmid 
encoding the bla gene. 

(B) Schematic diagram of the Ctx adaptation e3q>eriment. Cells were cultivated on LB agar 
containing increasing Ctx concentrations (^g/ml), as shown below petri dishes. 20-40 of 
the largest colonies were pooled after each passage and used for subsequent roubds of 

20 selection. 

(C) Upper panel, agarose gel analysis of RNA extracted from carrier state cells at passages 
AO, CI, C2, C3, C4, C7 and CIO. HB, RNA from umnfected HBIOY cells. Lower panel, 
RT-PCR products generated using fr/a-specific primers. Other designations are as defined 

. in the description of Fig. 2. 

25 (D) SDS-PAGE analysis (Olkkonen and Bamford, 1989) of carrier state cells from 
diflferent passages (AO, Gl, C4, C7 and GIO) or purified <|)6 virus ((j)6). HB, uninfected 
HBIOY cells. Panel G250, a Goomassie G250 stained gel fragment showing the band of 
protein Fl. a-Pl,.a-P2, a-P4, and a-P8, immunoblots produced using antibodies specific 
to corresponding <|)6 nucleocapsid (NC) proteins and BCL detection as reconmiended by 

30 Pierce Biotechnology. 

OB) Transnussion electron micr o graph of osmium tetroxide and uranyl acetate stained cell 
thin sections from AO and CIO passages taken as described. (Bamford and Mindich, 1980). 
Black arrowhead, enveloped virions; white airowhead, NC and PC particles. 

8 



Figure 4 depicts changes in the bla sequence population in response to cefotaxime 
selection. 

Graphs show normalized point mutation frequency at indicated nucleotide positions 
summed for n bla sequences from each passage. Bars corresponding to synonymous 
nucleotide changes are marked with the circles. Umnarked bars, caissense mutations. 

Figure 5 depicts frirther aspects of population dynamics of bla sequences during 
ads^tation to cefotaxime. 

(A) Normalized frequency of bla alleles containing a given number of mutations aS a 
function of passage. White, passages AO and Al; gray, passages CI to C4; black, passages 
C7 and CIO. 

(B) Distribution of different mutation types m bla sequences from CI, C2, and C3. 

(C) Percent identity plots showing genetic variance in bla populations from different 
passages. Plots (solid lines) are cumulative distribution fimctions of identities between 
every pair within n sequences, where the vertical axis represents the fraction of data points 
with the value as small or smaller than a given identity value. More heterogeneous 
sequence populations give plots more deviated from the 100% identity asymptote (dashed 
line). Data for related passages AO and Al and also for C7 and CIO were combined to 
improve statistics. Plots were created in GeneDoc (http://www.psc.edu/biomed/genedoc/)i 

DETAILED DESCRIPTION OF THE INVENTION 

1. Definitions 

Unless explicitly stated otherwise, specific terms used throughout this invention have the 
following meanings: 

The term "bacteriophage" refers to a virus infecting eubacteria or anotiier prokaryotic 
organism, such as e.g. archaea. 

The term "biological activity", as used herein, refers broadly to various frmctions and 
properties of a protein or nucleic acid. Examples of biological activities include but are not 
limited to catalytic, binding, and regulatory frmctions. 



As used herein, the term "biological entity", refers to all systems containing nucleic acids 
capable of multipUcation through a template-directed mechanism. 

As used herein, the term "carrier-state cells" refers to a cell line or plurality of cells 
infected by a virus, which can support multiple rounds of the virus genome rq>Ucation, 
remaining in a living state for a period of time substantially longer than a typical duration 
of the vims life cycle. 

As used herein, the term "directed evolution", or sometimes "directed molecular 
evolution", refers to a process of intentionally changing properties of proteins or nucleic 
acids using the algorithm, which comprises one or several rounds of subsequent 
diversification and selection steps. This algorithm is ascribed to natural evolution by 
Darwin^s theory. 

The term "DNA-dependent polymerase" refers to nucleic acid polymerase capable of 
copying DNA templates. Two types of DNA-dependent polymerases are known, producing 
DNA or RNA copies of DNA templates. These are referred to as DNA-dependent DNA 
polymerases and DNA-dependent RNA polym^ases, respectively. Also see "polymerase". 

The term "erroneous nature" is used here in reference to template-dependent nucleic acid 
polymerases lacking proofi:eading function or when describing the process catalyzed by 
such polymerases. 

The term "nucleic acid sequence", or sometimes "nucleotide sequrace", refers to an order 
of nucleotides in an oUgonucleotide or polynucleotide chain. 

The term "polymerase", or sometimes "nucleic acid polymerase", refers to a protein or a 
protein complex that can catalyze the polymerization of ribo- or deoxyribo-nucleoside 
triphosphates into a polynucleotide chain. 

The term "protein sequence", or sometimes "amino acid sequence", refers to an order of 
amino acid residues in a peptide or protein chain. 
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The term "proofreading", as used herein, refers to the capacity of certain polymerases to 
remove nucleotides incorrectly incorporated into a growing nucleic acid chain tibius 
ina:easing fidelity of the template copying process. In template-directed, synthesis, 
nucleotide incorporation into nucleic acid chain is considered inc6rrect if against the base 
complementarity rules by Watson and Crick. Polymerases of the present invention are 
characterized by the lack or deficiency of the proofreading activity, which enhances the 
mutation rate and generates sequence diversity in the target population. 

As used herein, the term "ribovirus" refers to an RNA virus whose life cycle proceeds 
dtitirely on the level of RNA and does not normally include a DNA phase. Riboviruses 
include vimses witti positive- and negative-sense single-stranded (ss) RNA genomes as 
well as double-stranded (ds) RNA viruses. A preferred embodiment of this invention deials 
with dsRNA viruses from the Cystoviridae family, also.referred to as "cystoviruses". Also 
see "RNA vims". 

As used herein, the term "reverse-transcribing virus" refers broadly to a viras whose life 
cycle necessarily includes both RNA and DNA phases. The name of the group derives 
from the process of "reverse transcription" used by these viruses wherein RNA molecules 
are used as templates to produce DNA copies. Two types of reverse-transcribing vimses 
are known, "retroviruses" and "pararetroviruses". Retroviruses encapsidate their genomes 
in the form of RNA but use DNA intermediates when multiplying in infected cells. 
Pararetroviruses encq>sidate DNA genomes but use RNA intermadiates when multiplying 
iri infected cells. 

The term "ribozyme" refers to an RNA molecule with detectable catalytic activity. Various 
natural and artificial ribozymes possessing diverse catalytic activities have been described 
in the previous art (Bittker et aL, 2002b; Doudna and Cech, 2002; Jaschke, 2001). 

The term "RNA virus" refers to virases having RNA genomes. 

As used herein, the term "RNA-based autonomous genetic element" refers generically to 
biological entities containing RNA genome but distinct from RNA vims. RNA-based 
autonomous genetic elements include but are not limited to RNA virus-like particles, 
viroids, and RNA plasmids. Another term sometimes used in the literature to refer to RNA- 
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based autonomous genetic elements is "RNA sub viral agent". Also see definition of 
"biological entity". 

The term "KNA-based organism", as used herein, refers generically to RNA virases and 
RNA-based autonomous genetic elements defined above. Because all RNA organisnois are 
capable of replicating their genomes under appropriate conditions, the term "RNA 
replicon" is used herein in reference to RNA organisms and d^vatives hereof to 
emphasize this capability. 

The term "RNA-dependent polymerase" refers to a nucleic acid polymerase enable of 
copying RNA templates. Two types of RNA-dependent polymerases are known, producing 
RNA or DNA copies of RNA templates, These are referred to as "RNA-dependent RNA 
polymerases" ("RdRP") and "RNA-dependent DNA polymerases" ("RdDP", better known 
as reverse transcriptases), re^ectLvely. Also see "polymerase". 

As used herein, the temi "screening" refers to procedures wherein variants having preferred 
properties are identified and/or picked fiom a target population manually or using an 
automated process. 

The temi "selection" is used herein in two contexts.. In a specific context, "selection" refers 
to procedure wherein different variants of a target population compete with each other so 
that only the fittest variants are retrieved, whereas less fit members of population are lost. 
This can be also defined as "bona fide selection". In a more general context, "selection" 
refers gen^cally to all procedures (including "screening" and "bona fide selection") 
wherein a fraction of variants is withdrawn fix>m a target population for fiirfher use. 

As used herein, the terms "target" or "target molecule" refer to a protein or nucleic acid 
that is subjected to the methods of this invention, which are designed for changing nucleic 
acid and proteins and inducing sequence-specific gene silencing effects. Plurality of target 
molecules comprising one or many distinct variants is sometimes referred to as "target 
population". The length of a target nucleic acid can be from about 20 bases, preferably 
firom about SO bases to IS kilobases, more preferably it is firom 300 bases to 3 kilobases. 
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'^Heterologous target sequence" refers here to a target sequence from any possible origin 
except fix>m the RNA-based biological m&ty (e.g. RNA virus), which is used in 
the replication of the target sequence. 

5 '^Detectable replication" refers here to tiie replication of the nucleic acid target detectable 
by any standardly available molecular biology method. 

As used herein, tiie term "suicide vector" or a more specific term "suicide plasmid" refer 
to, respectively, vector/plasmid that can not be stably maintained within given cell line but 
10 can direct transient gene expression. 

Other terms are explained in the text or used according to the common practices of the art 

15 2. Directed evolution 

2.L General considerations 

In the first aspect, this invention provides a method for changing nucleic acids and 
proteins. 

20 Replication of RNA genomes is catalyzed by RNA-dependent polymerases that lack 
proofireading fimction. This makes RNA copying an intrinsically erroneous process. As a 
specific example, relevant to preferred embodiments of this invention, tibie per-nucleotide 
mutation rate for dsRNA bacteriophage ^6 has been estimated at -1x10'^ to 2.7x10"^ 
depending on the method used (Chao et aL^ 2002; Drake and Holland, 1999). 

25 

In addition to fhe high mutation rate, many RNA genomes are capable of homologous 
and/or non-homologous recombination, which fiirther contributes to the genetic diversity 
(Domingo et aLy 2001; Miller and Koev, 1998; Negroni and Buc, 2001). Notably, genomes 
of dsRNA bacteriophages from the Cystoviridae family have been reported to recombine 
30 witii a detectable efficiency (Qnod^ et aly 1993; Qnodera et al.y 2001; Qiao et al, 1997; 
Qiao et aL, 2000). 

The quasispecies theory describes populations of RNA replicons as clouds, or swarms, of 
distinct but closely related genotypes (Domingo et aL, 1996; Eigen, 1996). Such 
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orgaxiization allows fhe rapid adaptation to new environments,, since a number of 
potentially advantageous mutations are already present in the population at the onset of 
selective pressure. 

Therefore, high mutation and recombination rates are likely reason of the remarkable 
evolutionary success of RNA viruses. Many RNA viruses, including HIV and hepatitis C 
virus, are known to efBciently escape host immune responses and medical treatment by 
promptly accumulating resistant mutants O^omingo et aL^ 1997; Farci et aL^ 2000; 
Hairigan and. Alexander, 1999). With continually emerging new strains and even species 
(Fouchier et aL, 2003; Maira et al, 2003; Nichol et aL, 2000), RNA viruses cause over 
75% of all viral diseases and constitute an overwhehning majority of all viral species 
OQjomingo et al.y 2001). 

It is within the scope of this invention to utilize the high evolutionary potential of RNA 
replicons for changing properties of target nucleic acids and proteins. The relevant method 
comprises the steps of: 

a) providing input nucleic acid target in a form replicable by a polymerase devoid 
of the proof-reading function; 

b) contacting said replicable form of the nucleic acid target with said polymerase 
under conditions sufficient for template-directed nucleic acid synthesis in a 
living cell; 

c) recovering nucleic acid synthesis products, whose nucleotide sequence differs 
fix>m said input target sequence by at least one nucleotide. 

It is obvious tibat the above method can be used in its gmeral form for introducing 
advantageous, neutral and/or disadvantageous changes into the nucleic acid sequence of 
interest (nucleic acid target). 

However, in the currently prisferred variation of the method, said recovering of modified 
nucleic acid synthesis products is performed after an appropriate selection/screening 
procedure, so that only advantageous changes are recovered. In this preferred form the 
method is intended for directed molecular evolution. This method variation employs an 
optimization algorithm comprising the steps of: 



14 



a) providing input nudeic acid target in a fomi replicable by a polymerase devoid 
of the proof-reading fiinctioi^ 
. b) contacting said replicable fonn of the nucleic acid target vdth said polymerase 
under conditions sufficient for template-directed nucleic acid synthesis in a 
living cell; 

c) selecting or screening nucleic acid synthesis products based on their propertiies; 

d) recovering nucleic acid synthesis products, whose properties are deemed 
superior to said input nucleic acid target 

In some embodiments, it will be sufficient to carry out only one round of the above 
optimization algorithm to improve target sequence to a sufficient extent. However, the 
method users will often find it more advantageous to perform two or more rounds of 
optimization. Indeed, the evolution of the TEM beta-lactamase sequence described in the 
Examples was carried out using at least two optimization roimds (passages). 

An important aspect of the method described above is the nature of the target. The strategy 
used by the method dictates the physical nature of the target to be a nucleic acid, preferably 
RNA, a usual template for polymerases lacking proofireading function. However, many 
nucleotide sequences can be translated into amino acid sequences, which makes the present 
invention broadly related to changing/improving both nucleic acids and proteins. 

2.2. Preferred formats 

Specific embodiments of the above-described method for changmg nucleic acid and 
proteins as well as the currently preferred method for directed evolution, may differ witti 
the respect to the formats used. 

Viral RNA vectors 

It is essential for the changing/evolving procedure that the nucleic acid target is provided in 
a form replicable by a polymerase devoid of the proofreading function. In most 
embodiments, this step is realized through linking the target with determinants required for 
detectable replication by said polymerases; 
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Li the selected formats, target is integrated wifliin KNA replicons, thus allowing replication 
of the target by an appropriate KNA-dependent polymerase. It may be advantageous for 
many applications to choose RNA viruses as KNA replicons. In tins case, integrated target 
is replicated as a part of viral genome by the virus-encoded polymerase, preferably RNA- 
5 depradent polymerase. Previous experiments where RNA virases were used as vectors for 
heterologous sequence inserts demonstrates feasibility of this approach. For example, 
alphavinises, retroviruses and some (^)RNA viruses are used as vectors for gene therapy 
and gene expression application (Palese, 1998; Robbins et aL^ 1998). Similarly, several 
KNA viruses infecting plants may also be used as vectors (Lindbo.^^ a/., 2001). 

10 

Although some embodiments of the method can rely on single-stranded KNA viruses, it 
may be advantageous for many ^plications to select viruses that have double-stranded 
KNA genome. dsRNA resist nuclease degradation better than ssKNA, which makes it 
easier to purify sufficient amount of intact dsKNA than that of ssKNA. Examples of 

15 dsKNA viruses include members of tiie Cystoviridae^ Reoviridae^ TottviridaBj 
Partitiviridae^ Bimaviridae and Hypoviridae families. Because of the economical and 
convenience reasons it may be advantageous to use viruses from the Toti- and 
Partitiviridae families, which infect prokaryotes and lower eukaryotic organisms such as 
yeast and other fungi. Bacteriophage ^6 and its relatives (4>7 through (t>14) infecting gram- 

20 negative bacteria and Saccharomyces cerevisiae viruses L-A and L-BC, that have been 
also known under the name of "virus-like particles", are amongst the most obvious choices. 

hi the curr^tly preferred embodiment, target gene is integrated within tiie genomic KNA 
of a dsRNA bacteriophage from the Cystoviridae family (a cystovirus). An important 
25 advantage of an KNA bacteriophage over animal or plant KNA vnuses is tibie low cost and 
relative ease of propagation. Furthermore, bacteriophages generally have shorter life 
cycles, which helps to reduce the time of the experiment. 

As a specific example of the dsKNA bacteriophage format, tarjget gene can be integrated 
30 into the M segment of the cystovirus ^6 and replicated by the (j>6-encoded KNA-dependent 
KNA polymerase. In further embodiments, other members of the Cystoviriae family, from 
<|)7 through ^14 (Mindich et aL, 1999), can be used as vectors for target sequences and also 
as polymerase source: Furthermore, any of the three genomic segments L, M and S, typical 
for the Cystoviridae, can be used for integrating the target sequence. 
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FiuHiennore, it is knowa tiiat at least some cystovinises can tolerate substantial genome 
rearrangemrats» which can be manifested in the form of . shortened or extended genomic 
segments, or a change in the segment number. For example, variants of <j)6 containing 1,2 
or 4 genomic segments have been described (Qnodera et al., 1995; Onodera et al., 1998). 
These modified cystoviruses are also within the scope of this invention, as they can be 
more advantageous RNA vectors than the wild-type cystovimses. 

It has been shown that the synthesis of cystoviral RNA is catalyzed by so-called 
polymerase coniplex that includes proteins PI, P2 (catalytic subimit), P4, and P7 (Mindich, 
1999a; Mindich, 1999b). The polymerase complex also serves as a cont^er for genomic 
RNA. All polymerase complex proteins are encoded on the segment L. Earlier studies have 
also demonstrated that bacterial cells e^qpressing cDNA of the L segment accumulate 
functional polymerase complex particles (Mindich, 1999b). Therefore, some embodiments 
may involve the use of cystovirus derivatives whose L segment encodes for the polymerase 
complex, whereas additional segment(s) are used for incorporating nucleic acid targets. In 
altemative embodiments, proteins of the polymerase complex can be produced from 
cDNA, which can be introduced into bacterial cell for example in the form of a DNA 
plasmid. In this case, the entire genetic capacity of the polymerase complex (^15 kb) can 
be used by RNA segment(s) encoding the evolution target(s). 

It is currently preferred feature that the RNA virus vector used is propagated in the form of 
carrier state cells. This type of viral infection does not destroy most of the infected cells, 
thus effectively extending time of the target gene e^qiression. Clearly, all formats where 
virus is not lethal for the infected cell will be particularly useful for the protein evolution 
projects. 

la the currently preferred embodiment, recombinant bacteriophage ^6 is propagated within 
carrier-state bacteria Pseudomonas syringae. Because at least some of the related 
cystoviruses have been shown to infect Escherichia coli and Salmonella typhimurium 
(Hoogstraten et a/., 2000; Mindich et al, 1999; Qiao et a/., 2000), additional embodiments 
of this invention will be based on the use of carrier-state gram-negative bacteria containing 
a recombinant cystovirus selected from the group of <|»6, <|)7, <[>8, <|)9, <|>10, ifW^ i|)12, <J>13, 
and <|>14. 
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Jn further specific embodiments, non-leihal infection can be achieved by using special cell 
lines, weakened (attenuated) virus strains, or both. As an example of the first strategy, 
mutants of P. syringae cells are known that form carrier state cells after being infected wititi 
the wild-type <|)6 viras. Attenuated viruses can be selected as naturally occurring mutants or 
engineered artificially. In some cases it will be sufficient to substitute a part of viral genes 
with the target sequence to obtain an attenuated virus. Interestingly, non-lethal infection is 
typical for iflie normal life cycles of several viruses. The examples include above- 
mentioned yeast totiviruses L-A and L-BC. 

Non-viral ENA vectors 

Althou^ the use of virus-based vectors is advantageous for many applications, some 
embodiments of the directed evolution method may use non-viral vectors. One example of 
this strategy is to use ^ecific elements that are replicated in nature by viral RNA- 
dependent RNA polymerases, such as diverse defective interfering (DI) elements and 
satellite RNAs. Specific examples include small RNAs multiplied by the RdRP of the 
coliphage QP and toxin-encoding satellites of the yeast L-A virus (Ml, M2, and others) 
(iBrown and Gold, 1995; Wickner, 1996). 

Another example of ndn-viraj vectors would be the use of autonomous genetic elements 
found for example in fimgi and plants. S. cerevisiae strains often contain single-stranded 
replicons called 20S RNA and 23S RNA. Of these, 20S RNA is an apparently naked RNA 
replicon (with a dsRNA form called W) encoding an RNA polymerase. 23S RNA also 
encodes an RNA polymerase and has a dsRNA form called T (Wickner, 1996). 
Furthermore, some plants, such as rice, are infected by extensive dsRNA elements, referred 
to as "RNA plasmids" or "endomaviruses" by different authors (Gibbs et aL, 2000). These 
elements encode their own RdRP and seem to lack coat proteins. Many RNA replicoris of 
the non-virus origin normally do not destroy the infected cell, which can be an 
advantageous feature as discussed above. 

Polymerase sources 

In the aforementioned embodiments, target nucleic acid, integrated into viral or non-viral 
RNA vector, is replicated by an RNA-dependent polymerase. It will be obvious for those 
skilled in the art that said polymerase can be provided in any number of ways. In some 
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embodiments, the polymerase will be encoded by the RNA leplicon containing the nucleic 
acid, whereas in other embodimaits the polymerase will be encoded by another RNA 
replicon co-infecting the host cell. 



5 In yet further embodiments, the polymerase can be encoded by DNA, which can be of 
chromosomal, plasmid, viral, transposon or other origin. An example of this format was 
discussed above for cystovirus-based vectors. In another specific embodiment, target 
sequence can be incorporated into viroid RNA and the replication of the genetically altered 
viroid RNA is probably carried out by cellular RNA polymerase H, operating in this case 
10 in the RNA-dependent mode (Lai, 1995). In other embodiments, viral polymerase geHes 
can be introduced in a DNA form into the host cell and expressed using ceUular 
transcription and translation apparatus. 

Delivery methods 

15 Another important aspect of the methods for changing/evolving biological molecules is the 
procedure used for bringing nucleic acid targets in contact with the polymerase lacking 
proofreading fimction. 

In a specific embodiment of this invention, this task can be accompUshed by contacting a 
20 repKcable form of the nucleic acid target with said polymerase within Uving cell. For this 
puipose, both target and the polymerase have to be deUvered into the host cell. 

Different deUvery methods can be used in different embodiments, ranging fiom delivery 
through virus infection, transformation (in bacteria), transfection (in eukaryotic ceU lines), 
25 electroporation, Upofection, balUstic methods, agroinfiltration, microinjection etc. 
Description of these and other delivery methods can be found elsewhere. 

In the currently preferred embodiment, illustrated in the Example 1, bacteriophage if6 
RdRP is deUvered into the host P. syringae cell using virus infection. The heterologous 
30 sequence is defivered eifiier through virus iiifection (as in the ^6-npt case) or in the form of 
a suicide DNA plasmid using electroporation (as in the ^6-bla case). 

In many embodiments, it may be advantageous to deUver RNA replicons containing 
marker genes. Such marker genes can be very usefiil to distinguish between cells that 
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contain RNA repUcon fiom the rest of the ceUs. fcdeed, cmrently available dehvery 
methods may not be 100% efficient, in that only a fraction of the treated cells usually 
receive the RNA rq)Ucon encoding the nucleic acid target. Examples of marker genes may 
include antibiotic or toxin resistance genes, genes encoding enzymes of amino acid or 
nucleotide metaboUsm, or genes encoding fluorescent proteins. Althou^ in some 
embodiments the marker gene can be equivalent to the evolution target, other embodiments 
may use marker genes that are distinct from the evolution targets. In the latter case, it is 
advantageous to ensure a physical linkage between said marker and target. In a preferred 
embodiment, said Unkage is achieved through encoding both markw and target on a single 
RNA segment. 

2.3. Preferred aj^lications 

The directed evolution methods of this invention can be preferably used to modify various 
properties of nucleic acids and proteins, as explained below. 

Evolving enzymes 

hi a specific embodunent. gene encoding an antibiotic-degrading enzyme (ampicillin- 
specific p-lactamase) is inserted into RNA virus genome. After an appropriate selection 
procedure a gene having modified sequence is recovered, that encodes the enzyme having 
altered antibiotic specificity (hydrolyzes cefotaxime in addition to anq>icillin). Hie 
modified antibiotic resistance genes can be usefiil as markets or reporters. Alternatively, 
this RNA-replicon based evolution procedure can be used to assess the piobabiKty of 
developing an antibiotic resistance to new antibiotics in pathogenic bacterial strains, as 
explained earlier (Qrencia et al., 2001). 

In additional embodiments, the directed evolution method can be generally used to create 
new catalysts, including diverse protem enzymes and ribozymes. or improve aheady 
existmg ones. Several parameters can be subjected to directed evolution process, including 
the use of modified substrates, substrate affinity and turnover, pH, ion strength, or 
temperature optima, enzyme behavior with respect to mhibitors and activators, and so on. 

In a specific embodiment where RNA catalysts (ribozymes) are targets for directed 
evolution, these are physically incorporated into RNA replicon, thus providing a link 
between genotype and phenotype. On the other hand, in other embodiments, designed for 

20 



evolving protein catalysts, RNA replicons encode target ptoteins. Jn Has latter case, the 
link betweoi genotype and ph^otype is provided by virtue of co-occurrmce of RNA- 
rq>licons and the cognate protein products \vitbin ttie same cell. Thus, by selecting a cell 
expressing inqnoved oizymatic activity one will also select the gene encoding the 
improved enzyme. 

An obvious requirement imposed on the directed evolution method of this invention is the 
need for selection or screaiing procedure, which is essential to recover improved variants 
after the sequence diversification step. A number of examples where such 
selection/screening procedure was possible have been discussed elsewhere. 

It may be advisable to devise a sfelection procedure if die enzyme can substantially 
contribute to the cell metaboKsm; Examples of this type include raizymes of amino acid, 
nucleotide and co-enzyme metabolic pathways, as well as hydrolases of different 
biqpolymers. In some embodimraits, it may be advantageous to perform selection for such 
activities using auxotrophic or otherwise deficient host cells. Furthermore, enzymes 
essential for cell survival under specific conditions such as those inactivating toxins, heavy 
metals, cell growth inhibitors should be evolved via appropriate selection procedure rather 
than screening. 

On the other hand, enzymes that can be detected by a color or fluorescent assay will be 
perhaps easier to evolve using manual or automated screwing, e.g. by using different 
detection units together with image recognition algorithms or alternatively by cell sorting 
methods such as fluorescence assisted cell sorting (^ACS). 

While the currently preferred embodiments of this invention deal with single enzymes, 
other embodiments may be focused on a simultaneous evolution of a groiq) of eazymes 
catalyzing several reactions, e.g. interdependent reactions constituting a methabolic 
pathway or a part thereof (Indeed, directed evolution methods have been successfiiUy 
applied to metabolic engineering; see (2aiao et al, 2002) and references therein). In this 
case different genes can be encoded by a single RNA r«pUcon or alternatively provided as 
several co-existing RNA repUcons. In the specific embodiment where multiple enzymes 
are evolved using «ji6 system, it may be advantageous to use the entire coding capacity of at 
least M, preferably both M and S, most preferably aU three genome segments, L, M and S. 
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Evolving regulatory molecules 

A specific embodiment of the above methods can be used for evolving reguiatoiy 
molecules. As in the case concerning enzyme evolution, the method can be directed to 
either engineering novel regulatory activities or improving existing ones. In some cases, 
regulatory molecules can be proteins or KNAs that activate or inhibit enzymatic activities 
through direct interaction Avith the enzyme. Examples of this class of molecules include 
e.g. diflferent RNase and polymerase inhibitors (Jeruzalmi and Steitz, 1998; Pasloske, 
2001). 

In other cases, regulatory protein or RNAs can modulate gene expression exerting 
activation or inhibition effects on the transcription, translation, or other levels of gene 
expression. This class of regulators includes different activators and repressors that interact 
with regulatory regions, such as gene promoters and terminators, as well as mRNA 
untranslated regions. Examples of regulatory proteins include catabolite activator protein 
(CAP), Lac repressor (LacI), bacteriophage lambda repressors CI and Cro, eukaryotic 
transcription factors such as GAL4, mRNA cap- and iron-responsive element binding 
proteins, and many others. In addition many regulators interact with basal factors involved 
in transcription or translation as discussed previously (Lemon and Tjian, 2000; Sachs and 
Buratowski, 1997). At the RNA level, examples of regulatory elements include translation 
enhancers, such as internal ribosomal binding sites (IRES) and diverse stem-loop/tRNA- 
like/pseudoknot structureis found in RNA viruses (Gallic and Walbot, 1990; Leathers et aL^ 
1993; Olsthoom et aL, 1999; Sachs, 2000; Vagner et al., 2001; Zeeiiko et al, 2002). 
Furrier examples include regulatory elements controlling mRNA stability and efficiency.of 
translation both in cis (e.g, iron-responsive elements (IRE) (Theil, 1993)) and in trans (e.g. 
recently discovered small regulatory RNAs, also known under the names of miRNAs and 
stRNAs (Grosshans and Slack, 2002)). 

Regardless of th6 regulation level, a preferred protocol for evolving regulatory molecules 
involves selection or screening for enzymatic (or other) activity that is affected by the 
regulator. If the evolution target is an activator, cells showing the highest enzymatic (or 
other) activity are selected. In contrast, cells showing the lowest activity are selected when 
it is necessary to improve an inhibitor. 
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Evolving molecules with specific binding activities 

In further embodiments, the evolution method of this invention can be used to develop or 
modify specific binding activities of proteins or RNAs. As in the case with enzymatic and 
regulatoiy activities, evolution of RNA molecules having specific binding properties will 
require tiiat the binding molecule is a physical part of a larger RNA replicon. And again, 
proteins with specific binding activities are produced firom genes encoded by RNA 
rq)licons. 

Selection for binding activities may require special experimental formats, involving 
displaying binding molecules for binding with immobilized or immobilizable Ugands. In a 
specific ^bodiment, protein having specific binding activity is displayed on tiie sur&ce of 
the cell containing RNA replicon, which encodes for tiie binding protein. Cells expressing 
desired variant of the protein can be separated fix>m the pool of cells cTq^ressing oth^ 
variants of the protein or expressing no protein at all using an afBnity selection procedure. 

In an alternative embodiment, proteins having an afiBnity to a given Ugand are displayed on 
a virus particle. The virus particle occludes the RNA replicon encoding the protein 
displayed, thus providing a genotype-phenotype link. Notably, the virus may or may not be 
the source of the polymerase activity required for the (erroneous) propagation of the RNA 
replicon within host cell, hi any case, the virus particles bearing the specific binder on the 
surface are selected firom flie pool of irrelevant virus particles using affinity purification 
based on the interaction with the ligand. 

In other embodiments, more specific strategies of selection can be used, depending on the 
nature of the binding molecule. For example, if tihie binding molecule is a part of a signal 
transduction pathway (such as cellular receptors or receptor-binding proteins), screening or 
selection for a specific cellular response triggered by the pathway can be used for evolving 
the binding activity. 

Evolving molecules with other activities 

Yet in fiirther specific embodinients, other biological activities can be improved using the 
evolution method of this invention. As an example, the procedure can be applied to the 
green fluorescent protein (GFP) originating from a jellyfish (van Roessel and Brand, 
2002). Wild-type GFP is excited by a blue part and emits in the green of the spectrum. A 
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number of GFP mutants with dififmnt spectral characteristics have been created using 
different diversification and screening/selection procedures. Some of the modified GFP 
variants are used as markers in cell biology and related fields. Using the evolution strategy 
of this invention, GFP gene can be propagated in a specifid embodiment within aii 
appropriate RNA replicon. Some of the appearing GFP mutants can differ firom the wt 
protein in their excitation or/and emission spectra. The cells producing altered GFP (and 
therefore containing RNA repHcons with the mutant GFP gene) can be detected either by 
eye or using an automated procedure such as e.g. FACS. 

The above procedure may be used in oth^ embodiments for evolving other fluorescent and 
pigment-binding proteins, as well as certain enzymes generating colored or fluorescent 
products and/or using colored or fluorescent substrates. 

Other utilities 

In additional embodiments, the directed evolution method can be employed for specific 
uses such as improving KNA stability, translation efficiency or codon usage. In this case a 
target KNA molecule encoding for a detectable biological activity is integrated into RNA 
replicon and the expression level of the encoded product is scored using an appropriate 
detection method. Some mutations generated during the propagation of the RNA replicon 
can increase the expression level of the product without affecting its biological activity 
target. It is expected that among such mutations can be changes increasing RNA stability 
against nuclease degradation, translation efficiency and the changes of rare codons to more 
commonly used ones. 

3. Gene silencing 

In the second aspect, this invention provides a method for inducing sequence-specific gene 
silencing effects, such as KNAi, wherein RNA replicons are utilized as vehicles for mass 
production of heterologous sequences in the dsRNA form in vivo. 

This method comprises the stepis of: 

a) providing nucleic acid target in a form replicable by an RNA-dependent RNA 
polymerase in a living cell; 
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b) contacting said replicable foim of the nucleic acid target with said polymerase 
under conditions sufficient for template-directed RNA synthesis, one of the 
reaction products bemg necessarily double-stranded (ds) RNA; 

c) recovering said dsRNA products in a sufficiently pure form and optionally 
modifying said products for optimal performance; 

d) using said pure, optionally modified, dsRNA products to induce sequence- 
specific gene-silencing effects in eukaryotic systems, such as organisms, cells or 
cell-firee extracts. 

Two major reqmrements affect the choice of preferred embodiments. 

(1) It is advantageous to produce large amount of sufficiently pure dsRNA molecules 
witiiout substantial expenses. 

(2) It is also advantageous to pCTfoim all the method steps within shortest time possible. 

Accordingly, the currenfly preferred embodimerits of the method utilize recombinant 
dsRNA viruses infecting prokaryotic and lower eukaryotic organisms, such as 
Cystoviridae, Totiviridae and Partitiviride. The hosts of these viruses, usually bacteria and 
fimgi, can be propagated easily and inexpensively, thus enabling a mass production of 
dsRNA firom the corresponding recombinant virus. In the most preferred embodiment, 
dsRNA viruses firom the Cystoviridae family are used as vectors for propagating 
heterologous sequences in the dsRNA form. Other embodiments can certainly make use of 
other viruses, both of dsRNA and ssRNA nature. The use of ssRNA viruses is theoretically 
justified since many of these viruses form dsKNA rqplication intermediates. 

It is fiirfhermore preferred that the target sequence to be converted and finiher propagated 
in the form of dsRNA is delivered into the host cell in the form of a DNA vector under the 
control of an appropriate DNA-dependent RNA polymerase promoter. The transcription 
product derived firom said DNA vector must comprise the nucleic acid target and the 
sequences sufficient for RNA replication. The host cell must contain RNA-dependent RNA 
polymerase that can replicate the target RNA molecule. 

In the currently preferred embodiment target sequence is delivered into P. syringae carrier 
state cells carrying <|)6 virus, in the form of a suicide DNA plasmid that can not be stably 
propagated in Pseudomonas but can be transiently transcribed by the cellular RNA 
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polymerase. The target is physically linked with a marker gene such as ampicillin of 
cefotaxime resistance gene, and therefore need not encode for any detectable activity. The 
translation of the target sequence into protein is also optional. The cells that acquired tiiie 
target molecule in the form replicable by ^6 polymerase complex will express the marker 
gene and will be distinguishable fix)m the rest of the cells (e.g. will be 
ampicillin/cefitaxime resistant). 

Further specific embodiments of tiiis invention are based on the use of other recombinant 
cystoviruses (4)7 through ^14) propagated within carrier-state Pseudomonas sp. or other 
gram-negative bacteria, such as Escherichia coli or Salmonella typhimurium. 

Because it is advantageous that the target sequence is not changed substantially when 
propagated in the form of dsRNA, in the preferred embodiments, the time of RNA replicon 
propagation is limited to minimum. In the most preferred embodiment, RNA replicon is 
propagated within appropriate cell line during 12-96 hours, preferably 24-48 hours. 

Li the currently selected embodiment, dsRNA is recovered firom the carrier state cells using 
a specific phenol/chloroform extraction and precipitation procedure described in the 
Example 3. Thus obtained dsRNA preparation may contain ribosomal RNA, tRNA, traces 
of the bacterial chromosome and proteins. It may therefore be advisable for dsRNA quality 
sensitive applications to amend this purification procedure with steps removing dsDNA, 
sspElNA and protein impurities. These steps may include but are not limited to purification 
using anion exchange chromatogrsQ)hy, adsorption chromatography on cellulose or silica 
resins, gel-filtration, as well as DNAse, protease or ssRNA-specific KNase treatments. Jn 
an alternative embodiment, dsRNA can be purified fsoxa isolated virus particles, which can 
also reduce the amount of impurities. 

In specific embodiments intraded for inducing sequence specific gene silencing in 
inverterbrate animals, fimgi, protozoa and plants, extensive dsRNA triggers purified as 
described above can be used as such. However, in vertebrate animals, long dsRNA may 
induce a number of imspecific effects, whereas 19-22 nt long dsRNA firagments induce 
sequence-specific silencing only (McManus and Sharp, 2002). It may therefore be 
advantageous for embodiments, which involve inducing RNAi in vertebrates or vertebrate 
cell lines, to fiagment long dsRNAs into 19-22 nt pieces. Several fiagmentation methods 
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have beea described elsewhere mcluding the hydrolysis by ribonucleases DICER and 
RNase m (Myers et aL, 2003; Yang et al, 2002). 

4. Kits for changing nucleic or protein sequences and kits for mass production of 
dsRNA 

One further object of this invention is a kit for changing nucleic acid or protein sequences. 
The kit comprises one or more, preferably at least two of the following items: 

a) a vector for transi^t expression of target nucleic acid in preselected cells that 
either are carrier-state or can be transformed into ceacner state and/or 

b) a genetically modified virus into where the target nucleic acid can be introduced; 
and/or 

c) cells that either are carrier-state or can be transformed into carrier state. 

Another object of this invention is a kit for mass production of dsRNA. The kit comprises 
one or more, preferably at least two of the following items: 

a) a vector for transient expression of target nucleic acid in preselected cells that 
either are carrier-state or can be transformed into carrier state and/or 

b) a genetically modified vims into where the target nucleic acid can be introduced; 
and/or 

c) cells that either are carder-state or can be transformed into carrier state. 

The vector is preferably a suicide vector. 

The following Examples provide finrttier illustrations of various aspects and embodiments 
of the present invention. A skilled artisan will appreciate that specific details can be 
modified without departing firom the scope of the invention. 

EXAMPLES 

Example 1. Introducing heterologous sequences into the genome of dsRNA virus ^6 
and creating carrier-state hosjt bacteria 



27 



LI. Bacterial strains and plasmids 

Escherichia coli DH5a was used as a host for plasmid propagation and gene engineering. 
Plasmid pEM35 was produced by inserting the neomycin phosphotransferase {npi) cassette 
from pUC4K (Pharmacia) at the Pstl site of pLM656 (01kkonen>/ a/., 1990). The correct 
plasmid encoding the (|)6 M segment with the inserted npt gene in the sense orientation was 
selected using restriction analysis. To construct pEM37, the Tfil-Xbal fragment, containing 
the ^6 M segment, was excised from pLM656, ttie ends were filled in using the Klenow 
fragment of DNA polymerase I, and the blunt fragment was inserted mto the pSU18 vector 
(chloramphenicol resistance marker; (Bartolome et al, 1991)) at HinSmrXbal sites. To 
produce pEM38, the P-lactamase {bid) gene was amplified from pUC18 usmg the primers 
5'.TTCACrGC4GATGCATAAGGAAGCATATGAGTATTCAACATTTCCGT-3' and 
5'-CAAAC71(3C^GAAGCTTACCAATGCTTAATCAGTGAG<k^ and Pfu DNA 
polymerase (Stratagene). The resulting PGR fragment was inserted at the Pstl site of 
pEM37 in the sense orientation. 

1.2. Constmcting (|>6*-npt carrier-state cells 

The infection of Pseudomonas syringae HBIOY with the wild-type ^6 culminates in cell 
lysis and release of viral progeny (Mmdich, 1988). However, when the kanamycin 
resistance marker npt was inserted into if6 M segment, it was possible to select carrier state 
bactma on Km-containing medium (Onodera et al.^ 1992). 

We repeated this experiment to. obtain a Km-resistant strain HB10Y((t)6-72pO- Briefly, 
purified recombinant ^6 procapsid^ (PCs) were packaged in vitro with recombinant m"*^ 
(single-stranded sense copy of ^6 M segment) containing the npt gene (T7 transcript from 
pEM35 treated with Xbal and mung bean nuclease) and the wild-type f and s* (single- 
stranded sense copies of L and S). The packaged ssRNAs were converted into dsRNAs 
using PC replication in vitro and the particles were coated with <|)6 P8 protein to produce 
infectious nucleocapsids (Bamford et al., 1995). These were used to produce recombinant 
virus plaques on a P. syringae HBIOY lawn. Material from one of the plaques (clone #26) 
was streaked onto LB agar plates containing 30 |xg/ml kanamycin (Km) to select carrier- 
state bacteria HB10Y(<|)6-«pO bearing the recombinant virus. These could be stably 
propagated on Km-containing LB agar or m LB medium without loosing the npt gene, as 
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judged by agarose gel electrophoresis of viral dsRNA ?ind RT-PCR with np/-specific 
primers 5'.CAAGGAATTCCATGGGCCATATTCAACC}GGAAA-3' and 5"- 
CCAGGATCCTTTAAAAAAACTCATCGAGCATCAAATGAAACr.3*. 

As expected, dsRNA segment M of the virus (M-npt), was longer than wild-type M, 

whereas ^C-npt L and S segments had regular lengths (Fig. 2A, lanes <|>6 and K). 

1.3. Constructing ^6-bla canier-state cells 

Constructing ^S-npt involved manipulations with purified RNAs and viral procapsids 
(PCs) in vitro, followed by spheroplast infection (Bainford et aL, 1995). To avoid these 
technical difficulties when preparing ^S-bla virus, we used a plasmid-based strategy (Fig. 
1) first developed by Mindich and colleagues (Mindich, 1999b). HB10Y(<|)6-«pO cells were 
transformed with plasmid pEM38 that encodes the (|>6 M segment containing the ampicillin 
resistance marker bla. 

For the transformsition, electrocompetent HB10Y((|>6-itpO cells were prepared as described 
(Lyra et a/., 1991). These (40 ^1) were electroporated with 0.1 mg/ml pM138. The cell 
su^ension was diluted with 1 ml of LB containing 1 mM MgS04, incubated at 2VC for 2 
h, and plated onto LB agar containing ISO ^.g/hil ampicillin. 

pEM38 can not replicate in P. syringae but it can direct transient expression of the 
recombinant M segment, as previously shown for other E, colt plasmids (Mindich, 1999b). 
Some of the RNA transcripts can be packaged by PCs, present in the HB10Y(<|)6-iipO 
cytoplasm, giving rise to virus. Indeed, Amp-resistant colonies (10* to 10^ jig"* 

DNA) appeared after 48-72 h of incubation at 28**C on pEM38- but not on mock- 
transformed plates. One of the Amp-resistant clones, which could be stably propagated in 
the presence of Amp, was used for subsequent experiments. Electrophoretic analysis of the 
(j)6-Wa dsRNA genomic segments revealed the presence of two M segment species, M-npt 
and a new segment, M-Wa, migrating between M-npt and wt M (Fig. 2A, lane AO). 
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1 A Carrier state bacteria conitain RNA-encoded antibiotic resistance genes 
We carried out RT-PCR analysis to ensure that the bla gene was indeed encoded by 
rather than by host DNA. The bla PGR product was readily detectable when nucleic acid 
extracted fiom HB10Y(<t>6-Wa) was reverse-transcribed and ampUfied using Wa-specific 
primers (Fig: 2B, lane 6). However, no product appeared in the control when the RT step 
was performed without reverse transcriptase Gane 5). This strongly suggests the RNA 
nature of the bla gene. Using «p/-specific primers, we also observed that HB10Y(<|>6-6to) 
bacteria retain detectable amounts of the npt gene (lane 4), consist^t with the 
electrophoretic analysis of HB10Y(<|.6-Wa) RNA. As expected, HB10Y(<|»6-«p/) cells 
contained only an RNA-encoded npt gene (lanes 1-3). 



Example 2. Directed evolution of ^-lactamase in ^bla carrier-state cells 

2.1. P. syringae carrying <|)6- but not DNA-encoded bla quickly adapt to cefotaxime 
Wild-type TEM-1 p-lactamase encoded by ^6-bla hydrolyzes penicillin p-lactam 
antibiotics (e.g. Ainp), but can not efBciently cleave third generation cefelosporins such as 
cefotaxime (Ctx). Since several Ctx-resistant p-lactiamase variants have been reported 
(Bradford, 2001; Orenda et al, 2001), we mvestigated whether these could be selected 
usmg the carrier-state bacteria. HB10Y(<|)6-Wa) ceUs were plated onto LB agar containing 
either 150 ^ig/ml Amp or 50 ng^ Ctx and incubated at 28»C. As a control, we used 
HBIOY cells transformed with a broad-range plasmid pLM254, whose bla gene is identical 
to that inserted into if6-bla (Mmdich et al, 1985). Both HB10Y((j)6-Wa) and 
HB10Y(pLM254) grew equaUy well on Amp medium (Fig. 3A). On Ctx medium, 
HB10Y(<|)6-Wfl) formed slowly growing colonies of various sizes with an average 
ftequency of ~4 cfu (colony forming units) per 10* cfu on Amp medium; no colonies were 
detected m the case of HB10Y(pLM254) by 96 h mcubation (Fig, 3A). Because the 
abundance of pLM254 within cells is comparable to that of U-bla (not shown), we 
conclude that Ctx-resistant mutants appear considerably more often when bla is encoded 
by the M segment of <j)6, rather than by plasmid DNA. 
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2.2. HB10Y((j>6-Wa) cells can gradually ad^t to high cefotaxime concentrations 
When the above experiment was repeated usiag ^100 ng/ml Ctx, no growth was detected 
even on the plates with HB10Y(<|)6-Wa). We therefore tested the possftiUty that increased 
Ctx resistance can be developed by gradually mcreasing the concentration of Ctx and 
selecting the best growers. HB10Y(<j)6-Wa) cells were passed 10 times with the Ctx 
concentration being elevated from 10 to 4000 ^g/ml as shown in Fig. 3B. The mitial 
HB10Y(<t>6-Wa) stock was referred to as AO and the cells obtained from different Ctx 
passages were caUed CI, C2,..., CIO. On average, lO^-lO' Amp cfu were plated onto 
several petri dishes and the 20-40 largest colonies were picked and pooled after 48 h 
mcubation. After a brief propagation (BA2 h, 28-C) in LB medium containmg Ctx at 1/4 of 
the plate concentration, the cells were subjected to the next round of selection. Repeating 
this procedure several times it was possible to obtain P. syringae fliat were resistant to 
2000-4000 jig/ml cefotaxime. 
•■/ 

Several analyses were used to verify the presence of ^6-bla throughout the adaptation 
process. First, ceUular RNA was studied by agarose gel-electrophoiesis and RT-PCR using 
Wa-specific primers (Fig. 3C). M segments of increased mobiUty were clearly present in 
all samples from CI to CIO, which correlated with the presence of the bla PCR fragment. 
M-bla was relatively sparse in CI cells as judged by the reproducfely weak RT-PCR signal 
and the dominance of M-b/?^ over M-bla on the RNA gel Oane CI). However, the amount 
of M-Wa m C2 to CIO is notably higher than in AO. The M-npt band dis^eaied from the 
RNA pattern at C2. 

In the second analysis, cellular proteins were separated by SDS-PAGE and subjected to 
unmunoblotting wilh polyclonal antisera against <|)6 proteins PI, P2, P4, and PS, 
components of ^6 nucleocapsids (Fig. 3D). Corresponding protein bands were detected in 
AO and CI to CIO. The major «|>6 capsid protein, PI, was also visible on Coomassie-stained 
gels. 

Fmally, when cairier-state bacteria were exammed by electron microscopy, ^6 subviral 
particles and enveloped virions were observed in the cytopasm of AO, CI, C4, C7 and CIO 
cells, but not in the HBIOY control (Fig. 3E, and not shown). 
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Example 3. Analysis of fhe bla evolutioii results 

3.1. Preparation of total RNA from carrier-state bacteria 

Bacterial cells pooled from 20-40 carrier-state coloxiies or pelleted from 1.5-ml liquid 
cultures were resuspended in 300 jil of 50 mM Tris-HCl, pH 8.0, 100 mM EDTA, 8% 
(v/w) sucrose. Lysozyme was added to 1 mg/ml and the mixture was incubated for 5 min at 
room temperature. Cells were lysed by 1 % SDS for 3-S min. SDS and most of the 
chromosomal DNA were precipitated by l.S M potassium acetate, pH 7.S on ice. RNA was 
precipitated from the supoiiatant fraction by the addition of 0.7 volumes of isopropanoL 
The RNA pellet was dissolved in 400 |il TE (10 mM Tris-HCl, pH 8.0; 1 mM EDTA), 
CTtracted successively with equal volumes of phenol-chloroform and chloroform, and re- 
precipitated with ethanol. The pellet was washed Avith 70% ethanol and dissolved in 100 jxl 
of sterile water. 

3.2. RT-PCR and cloning of the bla gene 

To obtain cDNA copies of the virus-encoded bla gene, total RNA (1 to 5 |xg) from carrier- 
state bacteria was mixed with 10 pmol of the reverse transcription primer (S - 
CTATCGAGCACAGCGCCAACT-SO, denatured by boiling for 1 min and chilled on ice. 
Reverse transcription was performed using AMV-RT (Sigma) at ^S^^C for 1 h as 
recommended. The bla cDNA was PCR amplified using a mixture of Pfii and Taq DNA 
polymerases and the primers 5'-CCGAATTCATAAGGAAGCATATGAGTATTCA-3' 
and 5'-CAACTTTTACGCTGGTGCTATACAACGACT-3'. Hmdm-^coRI cut PCR 
products were ligated with a similarly treated pSU18 vector and transformed into E. coli 
DH5a. Cloned bla sequences were determined using a commercial automated sequencing 
facility (MWG-Biotech). Throughout the paper, amino acid numbering is according to 
(Ambler et aU^ 1991), which exceeds the physical number by 2. 

3.3. Gene bla from Ctx-adapted carrier state P. syringae cells confers Ctx resistance in E. 
coli 
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To characteiize fhe possible effect of cefotaxime selection on the ^-lactamase gene, bla 
cDNA firom AO, C1-C4, C7 and CIO passages was cloned into pSU18 {E. coli plasmid 
containing chloramphenicol (Cm) resistance marker) under control of the lac promoter. E. 
coli DH5a was transformed with the resulting plasmid libraries and plated onto Cm 

5 medium. Because existing cefotaxime-specific p-lactamases are also resistant to ampicillin 
(Bradford, 2001), we used plates with a low Amp concentration (50 jig/ml) to screen flie 
libraries for clones containing the bla insert. A sufficient amount of p-lactamase was 
produced from the lac promoter without induction. Plasmids from the Amp-resistant clones 
(isolated from the master Cm plates) always contained the bla inserts. Conversely, several 

10 randomly selected clones that were resistant to Cm but not to Amp were the same size as 
the pSUl 8 vector. 

We next examined whether E. coli containing pSUlB with bla inserts originating from (|>6- 
. bla are also resistant to Ctx. For this purpose, -10^ cells were transferred from colonies 

15 grown on Cm, -to plates containing 5 or 10 jiig/ml Ctx. Of the 50-100 colonies analyzed for 
each library, 22% of the Cl-derived bla clones were indeed resistant to 5 fig/ml Ctx. In the 
case of C2-, C3-, C4-, C7- and ClO-derived libraries, the fraction of Ctx-resistant bla 
clones was 72, 81, 93, 100 and 100%, respectively, with most of the clones growing in the 
presence of 5 and 10 |xg/ml Ctx. No Ctx-resistant colonies were detected in the AO-derived 

20 library. 

3.4. Changes in bla sequence during adaptation to cefotaxime 

Complete bla sequences from several Ctx resistant clones were determined for each Ubrary 
25 (Fig. 4). Two bla alleles were found in the AO Ubrary. One of these was the wild-type 
allele, occurring at an apparent frequency of 0.22, while the other one contained a single 
U->C mutation that changed F24 to S and occurred at an apparent frequency of 0.78. 
Surprisingly, multiple mutations were found in bla sequences from initial Ctx passages, 
one segment often contaming several substitutions (up to 9 in CI; Fig 5 A). Most of the 
30 changes were transitions (Fig. 5B). 

hi addition to clone-specific mutations, two point mutations, F24S and a G->A substitution 
leading to flie G238S mutation on the protein level. Were detected in most bla sequences 
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from CI and subsequent passages. Beginning at C2, all sequences contained yet another 
common substitution, G-^A, that changed E104 to K (compare CI and C2 in Fig. 4). 
Interestingly, most clones in C4 and all clones from C7 and CIO contained only F24S, 
E104K and G238S mutations, with no other mutations being detected (Fig. 4). 
To ensure that the accumulation of bla mutants after the antibiotic change was a specific 
efifect of Ctx, we carried out a mock selection experiment. AO cells were plated onto dishes 
containing ISO mg/ml Amp and incubated for 48 h at 28^C (passage Al). dsRNA purified 
fix>m 40 pooled colonies was used to constract an RT-PCR library in E. coli as described 
above. No Ctx-resistant clones were found and no other alleles were detected besides wt 
and F24S (with frequencies of 0.4 and 0,6, respectively). 

Since 78% of the Amp-resistant clones fix>m the CI library failed to grow in the pres^ce 
of Ctx, we determined bla sequences from seven Ctx-sensitive clones. All sequences 
contained one or several mutations on the wt or F24S background, the overall picture being 
similar to Ctx-resistant clones (not shown). The only difference was that none of the Ctx- 
sensitive clones contained the G238S substitution. We conclude that the E104K and 
G238S mutations were critical to enable Ctx hydrolysis. Indeed, both mutations map to the 
enzyme active site and are often observed in Ctx-resistant bacteria (Bradford, 2001; 
Orencia e/ a/., 2001). 

The overall dynamics of the bla population adapting to Ctx is apparent from the percent 
identity plots (Fig. 5C). A relatively homogenous population in AO (and Al) was 
diversified dramatically in CI and C2. After the appropriate mutations were accumulated, 
the population regained homogeneity in C4-C7. Further passages did not change the 
genetic structure of the population. Importantly, the genetic heterogeneity in C2 and C3 
was clearly higher than in AO, and the Mrbla segment was more abundant in C2 and C3 
than in AO (Fig 3C). Therefore, possible effects of RT-PCR derived mutations can be 
excluded. 
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What is claimed is: 

1. A method for changing nucleic acid sequence, the method comprising: 

a) providing nucleic acid target in a form that can be repUcated by a polymerase 
devoid of the proof-reading function; 
5 b) contacting said repUcable form of the nucleic acid target with said polymerase 

under conditions sufficient for template-directed nucleic acid synthesis in a 
living cell; 

c) recovCTing nucleic acid synfliesis products, whose nucleotide sequence differs 
fix)m the initial target sequence by at least one nucleotide, 

10 

2. The method according to claim 1, wherein said nucleic acid target encodes a 
polypeptide. 

3. The method according to claim 1 or 2, wherein said polymerase is an RNA-dependent 
IS RNA polymerase. 

4. The method according to any one of claims 1 to 2, wherein said polymerase is an RNA- 
d^endCTit DNA polymerase. 

20 5. The mediod according to any one of the preceding clain^, wherem said nucleic acid 
synthesis products are recovered after one or several rounds of selection and/or screening. 

6. The method according to any one of the preceding claims, wherein the mettiod is 
specifically used for changing properties of proteins or nucleic adds in a desired manner. 

25 

7. The method according to any one of the preceding claims, wherein the nucleic acid 
target is operably linked with determinants essential for detectable replication by the 
polymerase. 

30 8. The method according to any one of the preceding claims, wherein the nucleic acid 
target is incorporated into the genome of an RNA virus or other RNA replicon such as 
RNA virus-like particle, viroid or RNA-based autonomous genetic element. 
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9. The method according to claim 8, wherein the RNA virus or other RNA ireplicon 
encodes fhe polymerase. 

10. The metiiod according to any one of the preceding claims, wherein the polymerase and 
the nucleic acid target are encoded by distinct nucleic acids. 

11. The method according to any one of the preceding claims, wherein the nucleic acid 
target is a nucleic acid having detectable biological activity, preferably selected from the 
group comprising enzymatic, regulatory and specific binding activity. 

12. The method according to any one of the preceding claims, wherein the nucleic acid 
target encodes a protein having detectable biological activity, preferably selected from the 
group comprising enzymatic, regulatory and specific binding activity. 

13. The method according to any one of flie preceding claims, wherein nucleic acid target 
is RNA. 

14. The metihod according to any one of the preceding claims, wherdn nucleic acid target 
is DNA. 

15. The method according to any one of flie preceding claims, wherein nucleic acid 
synthesis products are RNA molecules. 

16. The method according any one of the preceding claims, wherein nucleic acid synthesis 
products are DNA molecules. 

17. The method according to any one of the preceding claims, wherein the polymerase 
originates from an RNA virus or other RNA replicon. 

18. The method according to claim 17, wherein the polymerase originates from an RNA 
bacteriophage. 

19. The method according to claim 17 or 18, wherein the polymerase originates from a 
member of the Cystoviridae family, preferably fix>m a bacteriophage selected from the 
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group comprising ^6, 4>7, 4^8, 4)9, <|>10, <|>12, (j)13 and <|)14, most preferably from 
bacteriophage ^6. 



20. The method according to any one of the preceding claims, wherein the replicable form 
5 of the nucleic acid target is contacted with the polymerase in a prokaryotic cell, preferably 
in a gram-negative bacterial cell, more preferably in a bacterial cell selected from the 
group comprising Pseudomonas sp.^ Escherichia sp. 2016. Salmonella sp.^ most preferably 
in a cell of Pseudomonas syringae. 

10 21. The method according to any one of claims 1 to 19, wherein the repUcable form of the 
nucleic acid target is contacted with the polymerase in a eukaryotic cell, such as 
mammalian, insect, plant or yeast cell. 

22. The method according to any one of the preceding claims, wherein the nucleic acid 
15 target is delivered into the living cell using a suicide vector, preferably a DNA vector, most 

preferably a DNA plasmid. 

23. A method for inducing sequence-specific gene silencing effects in eukaryotic systems, 
the method comprising: 

a) providing nucleic acid target in a form repUcable by an RNA-dependent RNA 
polymerase in a living cell; 

b) contacting said replicable form of the nucleic acid target with said polymerase 
under conditions sufQcient for template-directed RNA synthesis, one of the 
reaction products being necessarily double-stranded (ds) KNA4 

c) recovering said dsRNA products in a suf&ciently pure form and optionally 
modiQing said products for optimal performance; 

d) using said pure, optionally modified, dsRNA products to induce sequence- 
specific gene-silencing effects in eukaryotic systems, such as organisms, cells or 
cell-free extracts. 

24. The method according to claim 23, wherein the RNA-dependent RNA polymerase 
originates from a dsRNA virus or a dsRNA replicon. 
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25. The method according to claim 23 or 24, wherein the RNA-dependrat RNA 
polymerase ori^nates froim the Cystoviridae iKimily, preferably fix>m a bacteriophage 
selected fix>m the group comprising ^6, (|>7, (|>8, 4^9, (|>10, ^11, 4^12, 4^13, 4^14, most 
preferably from bacteriophage ^6. 

5 

26. The method according to any one of claims 23 to 25, wherein the Uving cell is a 
prokaryotic cell, preferably a gram-negative bacterial cell, more preferably the bacterial 
ceU is selected from the group comprising Pseudomonas sp,, Escherichia sp. and 
Salmonella sp.y most preferably the bacterium is Pseudomonas syringae. 

10 

27. The method according to any one of claims 23 to 26, wherein the optional step of 
n^odifying for optional performance is fragmenting dsRNA witii dsRNA-specific 

. ribonucleases, preferably KNase m, Dicer, or derivatives thereof. 

15 28. The method according to any one of claims 23 to 27, wherein the nucleic acid target is 
provided in a suicide vector. 

29. The method according to any one of claims 23 to 28, wherein the dsRNA products are 
used to induce sequence-specific gene-silencing effects in invertebrate animal systems, 

20 preferably of insect or nematode origin, most preferably from Drosophila melanogaster or 
Caenorhabditis elegans oh^. 

30. The method according to any one of claims 23 to 28, wherein the dsRNA products are 
used to induce sequence-specific gene-silencing effects in vertebrate animal systems, 

25 preferably of mammalian origin, most preferably of human or mouse origin. 

31. A method for Constructing a recombinant RNA virus, whereio a suicide vector, 
comprising a target nucleic acid operably linked widi sequences sujQSicient for detectable 
rqplication by the viral replication apparatus, is used to incorporate said nucleic acid target 

30 into the genome of said RNA virus. 

32. The method according to claim 31, wherein the RNA virus belongs to the family of 
Cystoviridae. 
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33. A suicide vector, which comprises a beta-lactamase gene operably linked with 
deienninants essential for detectable replication by the RNA-synfhesis apparatus of a 
C>;s/ovxrzV2ae niember, preferably bacteriophage (|>6. 

34. A genetically modified cystovirus, which comprises a beta-lactamase gene conferring 
resistance to one or several antibiotics of the penicillin group, preferably ampicillin. 

35. A genetically modified cystovirus, which comprises a beta-lactamase gene conferring 
resistance to one or several antibiotics of the cephalosporin group, preferably cefotaxime. 

36. Carrier-state cells comprising the cystovirus of claims 34 or 35. 

37. The carrier-state cells according to claim 36, which comprises that they are bacteria, 
preferably gram-negative bactraia, more preferably bacteria selected from the group 
comprising Pseudomonas sp.^ Escherichia sp. and Salmonella sp.y most preferably 
Pseudomonas syringae, 

38. A kit for changing nucleic acid or protein sequences, the kit comprising: 

a) a vector for transient e}q>ression of target nucleic acid in preselected cells tiiat 
either are carrier-state or can be transfomied into carrier state and/or 

b) a geaetically modified vims into where the target nucleic acid can be introduced; 
and/or 

c) cells that either are carrier-state or can be transformed into carrier state. 

39. A kit for mass production of dsRNA^ the kit comprising: 

a) a vector for transient expression of target nucleic acid in preselected ceUs that 
either are carrier-state or can be transformed into carrier state and/or 

b) a genetically mpdified virus into where the target nucleic acid can be introduced; 
and/or 

c) cells that either are carrier-state or can be traxisformed into carrier state. 



48 



\ 

u 

(57) Abstract 

A novel strategy for directed evolution of nucleic acids and proteins is described, in 
which target nucleic acid , is copied by a polymerase devoid of proofreading 
function. Advantageous mutations generated during this process are recovered 
using an appropriate selection or screening procedure. The invention provides fast, 
inexpensive and non-laborious methods for practicing said strategy, which are 
utilized either separately or in combination with othor methods for engineering 
biopolymers with desired properties. The invention furttieimore provides kits for 
directed evolution according to the described methodology. In an aspect, the 
invCTtion discloses inethods and kits for producing nucleic acids encoding proteins 
with desired prop^es. In yet another aspect, the invention deals with methods and 
kits for propagating nucleic acid targets in the form of double stranded RNA, 
suitable for triggering RNA silencing effects. 
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